TS-Net: OCR Trained to Switch Between Text Transcription Styles
نویسندگان
چکیده
Users of OCR systems, from different institutions and scientific disciplines, prefer produce transcription styles. This presents a problem for training consistent text recognition neural networks on real-world data. We propose to extend existing with Transcription Style Block (TSB) which can learn data switch between multiple styles without any explicit knowledge rules. TSB is an adaptive instance normalization conditioned by identifiers representing consistently transcribed documents (e.g. single document, transcriber, or institution). show that able completely in controlled experiments artificial data, it improves accuracy large-scale learns semantically meaningful style embeddings. also how efficiently adapt new transcriptions only few lines.
منابع مشابه
Evaluating OCR and Non - OCR Text
In literature, many feature types and learning algorithms are proposed for document classiication. However , an extensive and systematic evaluation of the various approaches has not been done yet. In order to investigate diierent text representations for document classiication, we have developed a tool which transforms documents into feature-value representations suitable for standard learning ...
متن کاملText Pre-processing and Text Segmentation for OCR
Optical Character Recognition (OCR) systems have been effectively developed for the recognition of printed script. The accuracy of OCR system mainly depends on the text preprocessing and segmentation algorithm being used. When the document is scanned it can be placed in any arbitrary angle which would appear on the computer monitor at the same angle. This paper addresses the algorithm for corre...
متن کاملA complete OCR for printed Tamil text
A Neural Network approach is proposed to build an automatic off-line handwritten Tamil character recognition system. We have used a Back Propagation Network (BPN) as a character recognizer. Once trained, the network has a very fast response time. However, the learning phase of this recognizer is a relatively difficult task in this application. The input image of the handwritten character is giv...
متن کاملImaged Document Text Retrieval Without OCR
ÐWe propose a method for text retrieval from document images without the use of OCR. Documents are segmented into character objects. Image features, namely, the Vertical Traverse Density (VTD) and Horizontal Traverse Density (HTD), are extracted. An n-gram based document vector is constructed for each document based on these features. Text similarity between documents is then measured by calcul...
متن کاملFuzzy Full-Text Searches in OCR Databases
Though the quality of optical character recognition software is steadily improving, it is still far from being perfect. As a result, full-text databases that are lled by means of OCR software contain many errors. These errors have to be taken into consideration if such kind of databases are examined by means of full-text searches. In this chapter, we will illustrate some of the possible methods...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-86337-1_32